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The theory of transition between a-helix, /3-sheet and random coil conformation of a protein is 
discussed through a simple model, that includes both short and long-range interactions. Besides the 
bonding parameter and helical initiation factor in Zimm-Bragg model, three new parameters are 
introduced to describe beta structure: the local constraint factor for a single residue to be contained 
in a /3-strand, the long-range bonding parameter that accounts for the interaction between a pair 
of bonded /3-strands, and a correction factor for the initiation of a /3-sheet. Either increasing local 
constraint factor or long-range bonding parameter can cause a transition from a-helix or random 
coil conformation to /3-sheet conformation. The sharpness of transition depends on the competition 
between short and long-range interactions. Other effective factors, such as the chain length and 
temperature, are also discussed. In this model, the entropy due to different ways to group /3-strands 
into different /3-sheets gives rise to significant contribution to partition function, and makes major 
differences between beta structure and helical structure. 

PACS numbers: Valid PACS appear here 



I. INTRODUCTION 

The formation of protein secondary structure has at- 
tracted great interest in the past 50 years, due to their 
remarkably regular spatial arrangement [lj] . There are 
usually two distinct interactions involved: namely short- 
range interaction between atoms and groups which are 
near neighbors in sequence along the chain; and long- 
range interactions involving pairs of units which are re- 
mote in sequence but near in space Q. How these inter- 
actions affect the formation of secondary structure, espe- 
cially /3-sheet, is essential in the study of protein folding. 
On the other hand, a protein sequence can display struc- 
ture ambivalence and interconverts between a-helix and 
/3-sheet conformations Q ■ This phenomenon may concern 
about many well-known diseases, such as Alzheimer's, 
Mad Cow and Parkinson's disease. But the underly- 
ing mechanism is still not very clear. In this paper, we 
try to establish a unified statistical model of a-helix/ (3- 
sheet /random-coil transition in proteins, which includes 
both short-range and long-range interactions. It will be 
an extension of Zimm-Bragg's (Z-B) model to include 
beta structures as 

wellH, 

and is totally different from for- 
mer tension-induced a//3 transition theory @, 0, H B , 
which has only mentioned short-range interactions. 

a-helix and /3-sheet are most important secondary 
structure elements in protein. An a-helix is found when a 
stretch of consecutive residues all have torsion angle pair 
(4>, ip) approximately —60° and —50°, the allowed region 
in the bottom left quadrant of the Ramachandran plot. 
In a-helix, the CO group of residue n is bonded to the NH 
group of residue n + 4. Thus, it contains only short-range 
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interactions. The /3-sheet, however, is built up of several 
segments of the polypeptide chain. And long-range in- 
teractions are involved. These segments, /3-strands, are 
usually of 5 ~ 10 residues long, with (0, ip) angles in 
the broad structurally allowed region in the upper left 
quadrant of the Ramachandran plot. The /3-strands are 
aligned adjacent to each other, such that hydrogen bonds 
can form between CO groups of one strand and NH group 
of an adjacent strand, and vice versa. 

In the presence of long-range interactions, it is not suf- 
ficient to distinguish states of a protein by only iden- 
tifying the states of each amino acid. Various bond- 
ing patterns of /3-strands can result in different states. 
Therefore, the different ways of arranging /3-strands into 
/3-sheets have to be considered. In this paper, we will 
establish a model, in which two sets of codes are intro- 
duced to represent respectively the state of each residue 
and connecting pattern of the /3-strands. Accordingly, 
we are able to write down the partition function, through 
which the transition between alpha, beta and random coil 
structures of a protein is discussed in detail. 

When no a-helix is presented, our model shows that 
a coil-to-sheet transition occurs as the local constraint 
factor is increased. And the sharpness of transition is de- 
termined by the long-range bonding parameter and the 
initiation factor. The smaller these two parameters are, 
the sharper the transition will be. This result has similar 
feature as that in coil-to-helix transition, which has been 
studied extensively by Z-B model. With the increment 
of the local constraint factor, the normalized number of 
/3-strands increases firstly, and then decreases after pass- 
ing the transition point, due to the competition between 
short-range and long-range interactions. 

Helix/sheet transition is of great interest in this paper. 
In the presence of long-range interactions, the entropy 
contribution to partition function becomes more impor- 
tant. This is due to the ways of connecting /3-strands 
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into /3-sheets can be various, even when the state of each 
residue is known. Unlike enthalpy controlled helix/coil 
transition in Z-B model, helix/sheet transition is driven 
by entropy. A transition from a-helix to /3-sheet occurs, 
when either the local constraint factor of beta residues 
or the long-range bonding parameter of /3-stands is in- 
creased. The sharpness of transition depends on the 
competition between short-range and long-range inter- 
actions. Moreover, the compromise between short-range 
and long-range interactions is able to give rise to mixed 
a/ (3 structure. 

The length and temperature dependence of the transi- 
tion are also studied. Statistical study of protein struc- 
tures reported in Protein Data Bank (PDB) shows that 
the fractions of alpha and beta residues are nearly un- 
changed when the chain is sufficiently long(n > 50). 
Comparing our theoretical result with the data, we are 
able to identify the values of parameters in our model. 
Interestingly, the parameter values are close to the tran- 
sition point. This indicates that protein sequences in 
living cells are well selected by evolution, to compromise 
the required properties such as stability, flexibility and 
diversity. The temperature dependence is studied based 
on above parameter values. Our results show that at low 
temperature, a-helix is dominated because of strong local 
interactions; while at high temperature, beta structure is 
more favored due to larger entropy contribution. 

This paper will begin with an extension of Z-B model 
in Section HH where connection properties of /3-strands 
are introduced to obtain a complete description for the 
state of a protein. The partition function is obtained in 
Section HTTl by simple assumptions on the states of amino 
acid residues and /3-strands. In Section lTVl we discuss the 
transition from random coil and a-helix to /3-sheet under 
different conditions. At last, a brief conclusion is given 
in Section IVl 



II. MODEL 

This section presents a model for the polypeptide chain 
that is intended to extend Z-B model to include beta 
structures. Specifically, it establishes a second order cod- 
ing for the connection of /3-strands in /3-sheets. The par- 
tition function is then formulated as contributions from 
all a, all /3 and a//3 mixed structures. To describe the 
model in detail, we firstly have to make a complete de- 
scription of the conformation of a chain, which will be 
finished in two steps. 

The first step is to define the state of each amino 
acid residue, in other words, which one is a-helical and 
which one is in /3-strand. Definition of a-helical residue 
is straightforward, i.e., those residues whose NH group is 
bonded to the CO group of the fourth preceding residue. 
Here we have assumed that bonding of a residue, if it oc- 
curs in a-helix, is always to the fourth preceding residue, 
and disregard other helical structures such as 7r-helix or 
3in-helix[ll|. To determine whether a residue is in /3- 



strand is sticky and depends on the state of its neighbor- 
ing residues. We assume that a residue is regarded as in 
/3-strand if the angle pair (</>, tp) of itself or both of its 
neighboring residues take values in the upper left quad- 
rant of Ramachandran plot. Now, a chain of n residue 
can be described by a sequence of n symbols, each of 
which can have one of three values: digit represents a 
random coil residue, 1 for an a-helical residue, and 2 for 
a residue in /3-strand. An example is shown by the 1st 
code in Fig. [TJ 

Knowing the state of each residue is not sufficient. We 
need a 2nd code to describe how /3-strands are connected 
into /3-sheets. We assign a symbol for the j'th strand 
in the i'th /3-sheet(2nd code in Fig. [TJ. Here, the num- 
ber j is assigned not according to the sequence along the 
chain, but the spatial connection position in the /3-sheet. 
Now, the state of a chain can completely be described 
by these two sequences of codes as shown in Fig. [TJ We 
will see later that this second step makes major differ- 
ences between /3-sheet and a-helix, and gives rise to the 
helix/sheet transition. 

Finally, we introduce the statistical weight for a given 
state of a chain as the product of following factors, ac- 
cording to the coding established above: 

(1) The quantity unity for every (coil residue). 

(2) The quantity s for every 1 that follows a 1 (helical 
residue). 

(3) The quantity as for every 1 that follows [i or more 
0's (initiation of a helix). 

(4) The quantity w for every 2 (residue in a /3-strand). 

(5) The quantity p for every 5? with j > 2 (/3-strand). 

(6) The quantity rjp for every S} (initiation of a /3- 
sheet). 

(7) The quantity for one of the following cases: 

(a) every 1 that follows 2 or a number of less 
than /i; 

(b) every 2 that follows 1 or a number of less 
than yit; 

(c) every single 2 between two 0. 

The effect of assumption (7) is that the secondary struc- 
tures are separated from each other by at least coil 
residues; and each /3-strand contains at least 2 residues. 
The number fj, is varied from case to case. For example, 
two helices are separated by at least 4 residues, while two 
/3-strands can be separated by only two residues (/3-turn) . 
In this paper, we will take a uniform value, say [i = 3, 
for simplicity. We will see later that the value of [i has 
little effect on the transition, especially for long chains. 

The meaning of the statistical weights are as follows. 
The first three weights are the same as those in Z-B 
model[l2|, [l^]. The factor unity is arbitrarily assigned to 
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Peptide units: 12345 6 789 - • • 

1st code: 00000 1 11100002 2 2 2 2 2000002 2 2 2 2000002 2 2 2 2 • • • 

Weights: 11111 as sssllllwwwwwwlllllwwwwwlllllwwwwwll--- 

2nd code: S{ Sf Si 

Weights: rjp p rjp 

FIG. 1: Coding and weight of residues in a chain. 



coil residues, since only the relative ratio is effective. The 
factor s measures the contribution of a helical residue, 
relative to a coil residue, to the partition function. It 
contains a decrease due to restriction of torsion angle and 
an enhancement because of the hydrogen bonding. The 
factor a represents the decrease of weight s for the first 
unit in a helix, since the formation of first hydrogen bond 
causes restriction of the freedom of residues between the 
bonded ones. 

The next three weights are new and for beta structures. 
The factor w measures the contribution of a residue in /3- 
strand, relative to a coil residue, to the partition function. 
It contains an slight increase due to the lower energy com- 
pared to coil residues @. Note the torsion angle of beta 
residues can take values from a quite broad region from 
the upper left quadrant of the Ramachandran plot, thus 
the restriction in freedom is not serious. The factor p rep- 
resents long-range interactions between a pair of bonded 
/3-strands. This includes a decrease due to restriction of 
freedom of residues between the two bonded /3-strands, 
and an enhancement due to the hydrogen bonding. Nev- 
ertheless, since two /3-strands can be separated by long 
distance along the chain, the effect of decreasing is dom- 
inant. Therefore, the value of p is usually small. Similar 
to the case of helix, an initiation factor rj is introduced to 
measure the decrease of weight p for the first pair of (3- 
strands in a /3-sheet. In summary, for the five weights in 
the model, the factors s and w are usually sightly larger 
than unity, while a, p and rj are all less than unity. 

In the above discussion, we assume that the bond- 
ing energy(factor p) of each pair of /3-strands are the 
same, and independent of the number of both residues 
and hydrogen bonds between the bonded /3-strands. It 
is a highly simplified representation of the problem and 
enables us to write down the partition function. One 
may introduce a set of factors p(ni, 712), which depends 
on those two values, to make the model more realistic. 
But since bonding patterns of /3-sheet varies greatly, our 
present knowledge is too incomplete to justify a more 
refined model. 



III. MATHEMATICAL TREATMENT 

A formal representation of partition function Z for a 
chain of n residues can be obtained from above model 
by direct enumerating of the number of different ways of 
arranging digits 0, 1, 2 and the state of strands Sf. 

Let X = (n a , nb,h,lb> kb) be the state of a chain, where 
n a , nb, l ai kb are respectively the number of helical 



residues, beta residues, a-helices, /3-strands and /3-sheets. 
Then the statistical weight of state X is given by 



Q(X) 



>w n »a la p lb r) k » 



(1) 



Let S(X) be the number of states, which has the same 
statistical weight as X, and f2 be the phase space of pos- 
sible states, then the partition function is represented as 



z=J2 Q(x)S(x). 



(2) 



xen 



Explicitly, the phase space Q is given by 



0<fe 6 < l b /2<n b /i< Ljfy, 1 
0</ o <n o < Lif^J, 
o=^XgZ 5 : n-(n a + n b )>fi(l a + l b ) 
h = n-b = if kb = 
n a = if l a = 



The degeneracy S(X) is formulated as 

\ t n a — 1 nb — tfa — 1 tfa — — -I- 

n— n a — Tib— (ft— l)(la+h) la+h' 



(3) 



This formula is obtained by following steps. Firstly, we 
partition n a amino acids into l a a-helices, rib amino acid 
into lb /3-strands and lb /3-strands into kb /3-sheet along 
the chain. Then we insert n — n a — rib coil residues be- 
tween a-helices and /3-strands. Finally, we rearrange the 
order of a-helices and /3-strands. The meaning of each 
term is explained as follows. The binomials C^~_ x rep- 
resents the number of ways to group n a helical residues 



into l a helices. The terms C ' 



and C h-k b -x are 
similar, representing the number of ways to group beta 

residues into /3-strands and /3-strands into /3-sheets, re- 
spectively. Here each /3-sheet has at least two strands; 
and each /3-strand has at least two residues. The binomial 
C [ ^ b a _ nb _^_ l){la+lb) represents how many ways to in- 
sert n — n a —rib coil residues between the secondary struc- 
ture elements, such that consecutive helix and strand are 
separated by at least /x coil residues. The last factor 
C\ b +lb represents how many ways to arrange the order 
of a-helices and /3-strands. Note that here the a-helices 
are considered as identical. The /3-strands are assumed 
to connect to only the neighboring ones. Long-distance 
connections in /3-sheets are neglected, since the usual 
adopted arrangements are quite limited at room tempera- 
ture. One may propose other assumptions. For example, 
/3-strands are assumed to be able to connect to each other 
freely (lb\C\ b +lb ), despite their length and distance. Or 
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all a-helices and /3-strands are considered as distinguish- 
able {{l a + Zh)!), in case of very strong long-range inter- 
actions between secondary structure elements. Different 
assumptions may give different results, but the general 
property of transition is almost unchanged, according to 
our simulations. Further discussions are shown in the 
appendix. 

The partition function can be rewritten according to 
the structure of the chain as follows: 



Z = 1 + Z a + Zp 



V/8 



where 



Z a — 



Za = 



L l+jj. J n — (j,l a 

E E fC&Ckn. 

l a = l n a =l a 

L4+271J 1-2 + ^ J 

v 2^ p u h-k b -i 



-(p-l)la 



(4) 



(5) 



k b = l l b =2k. 

n— nh 

x 

7i(,=2Z(, 
L4 + 277J L 2 + fi J 



/ 1 w h— l w n— fit,— (m — 1)'* 



w nb Cl b ~ l 



E ^ E ^ 



lbr<kb — 1 
Zfa — fcfo — 1 



n— /iij, 



L 1 + n J 



x E »" 6 ^-l-i E ^ 

n-n b -p(i a +ii,) 

7la—la 



la 



X 



n -l w n-TH,-n a -(/*-l)(J»+i 



Here Zq, (or Zp) represents the partition function for th 
states of all a (or /3) structures, and Z a /p for a//3 mixe< 
structures. 

In the following, we will show how the a-helix, /3-sheet 
and random coil conformation at equilibrium state tran- 
sit to each other, when the parameters are changing. We 
define the fraction of helical and beta residues respec- 
tively as 



n 



1 dlnZ 



n d In s ' 



->b ■- 



n 



1 dlnZ 



n dhiw 



(8) 



The normalized number of a-helices and /3-strands are 
also of interest. 



n 



1 dlnZ 



It. 



1 dlnZ 



n d In a 



°b ■= — = 

n n omp 



(9) 



IV. RESULTS AND DISCUSSIONS 



From discussions in previous section, it is easy to show 
that in the absence of long-range interaction, the parti- 
tion function(Z = 1 + Z a ) is the same as that for he- 
lix/coil transition, which has been well studied by Z-B 



model. In following, we will focus on how the parameters 
induce transitions between a-helix, /3-sheet and random 
coil. From ((5])- ([7]), the parameter ji has only minor effect 
on the results. Here, we will set 11 = 3 for the minimum 
number of coil residues between secondary structures. 



A. Sheet/Coil Transition 

At first, we study the transition from random coil to 
regular /3-sheet(Z = l + Zp), which is similar to helix/coil 
transition @, [3? ]. 

In this case, We have three tuneable parameters: w for 
local constraint of single residue, p for long-range bonding 
interactions between a pair of bonded /3-strands and rj for 
initiation correction of a /3-sheet. The dependence of 9b 
and 4>b on the parameters are shown at Fig. [51 
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FIG. 2: Sheet/coil transition (n = 200). The dependence of 
8b (a) and <j>b (b) on w, with different values of p, and rj is 
set to 1.0. The dependence of 0b (a) and <j>b (d) on w, with 
different values of rj, and p is set to 0.01. 

From Fig. [2ja), [He), we can see that when w is in- 
creasing, the fraction of beta residues 9b goes from zero to 
unity monotonously. This means the random coils turn 
into regular /3-sheets when the local interactions become 
stronger. This transition usually happens at w — 1, in 
accordance with Z-B model. The sharpness of transition 
depends on p and rj. The smaller p or rj is, the sharper 
the transition will be. 

Unlike the fraction of /3 residues, which increases 
monotonously with respect to w, the number of /3-strands 
increases firstly to reach a peak at around the transi- 
tion point, and then decreases when w keeps increasing 
(Fig. [2^b),(d)). We argue that this is a consequence 
of the competition between short and long-range inter- 
actions. Generally speaking, when w is less than unity, 
short-range interactions prompt long-range interactions. 
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With the increment of w, there tends to be more (3- 
strands, as well as the strands get longer. Thus 9b and 
increase simultaneously. When w is larger than unity, 
short-range interactions suppress long-range interactions. 
As a consequence, when w is increasing further, 9b is al- 
most unchanged while <pb drops continuously. It means 
that there tends to be less /3-strands and the strands still 
grow longer and longer. 



B. Helix/Sheet Transition 

A more interesting result is a-helix//3-sheet transition. 
Since the major difference of /3-sheet from a-helix is the 
existence of long-range interaction, which is described by 
parameters p and r\. Thus not only the increase of local 
effect w, but also the increment of long-range interaction 
parameters p and rj can induce a transition from helical 
structure to beta structure(Fig. [3J. 




FIG. 3: Helix/sheet transition (n = 200). In (a)(c)(e), the 
dashed lines are for 9 a ; while the solid lines are for 6b- In 
(b)(d)(f), the dashed lines are for <fi a ; and the solid lines are for 
4>b- (a) The dependence of 9 a , Ob on w. (b) The dependence 
of (f> a , 4>b on w. (c) The dependence of 8 a , Ob on p. (d) The 
dependence of <f> a , 4>b on p. (e) The dependence of 9 a , Ob on 
p. (f) The dependence of 4> a , 4>b on p. In all computations, 
the chain length is n = 200. The parameters used are, except 
particular indicated, s = 1.15, a = 0.037, w = 1.03, p = 0.072 
and r\ = 1. 



From Fig. [3ja,b), we can see that a- helices transit 
to /3-sheets with the increment of short-range interaction 
w. When w is further increasing, the short-range inter- 
action takes over long-range interactions, and therefore 
the normalized number of /3-strands 4>b decreases. 

From Fig. EDJc)-(e), we can see that the fraction of beta 
structure is very sensitive to the long-range interactions 
(p and 77) when the parameters are small (p, rj -C 1). This 
is a consequence of large entropy and high cooperativity 
of beta structures. Yet when the long-range interactions 
are strong enough (p, 77 ~ 1), the dependence of 9 and <fi 
on the parameters are far less evident. 

In the present of long-range interactions, the entropy 
contribution to the partition function become important. 
This is originated from two sources: different methods of 
arranging secondary structure segments along the chain; 
and the ways of connecting /3-strands into /3-sheets, which 
is specified for beta structure. In protein, helical struc- 
ture tends to have lower energy due to local hydrogen 
bonds, while the beta structure has larger entropy. As 
consequence, coil/helix transition is enthalpy controlled, 
while helix/sheet transition is entropy driven. Moreover, 
the longer the chain is, the sharper the helix/sheet tran- 
sition will be. This helix/sheet transition picture may 
shed light on the mechanisms of structure transition of 
prion and other proteins Q. 



C. Length Dependence 

The length dependence of the average fractions of al- 
pha and beta residues obtained by studying the native 
structures in Protein Data Bank is plotted in Fig. 2] 
The results show that despite the variety from one pro- 
tein to another, the average fractions are roughly inde- 
pendent to the chain length, except for very short chains 
(n < 30), with about 32% alpha residues and 25% beta 
residues. The data can be fitted by our model with pa- 
rameters w = 1.03,p = 0.072, 77 = l,s = 1.15, cr = 0.037 
(Fig. 01(a)). Note that here the number of alpha residues 
from our model is given by n a = n(9 a + 30 a ). Since in 
our model, the first three residues in every helix is not 
treated as helical. The same set of parameters also gives 
good agreement for the average number of a-helices and 
/3-strands(Fig. [31b)). Accordingly, we obtain the aver- 
age length of a-helix to be 11, and /3-strand to be 6, both 
agree with experimental data. 

In previous simulation results(Fig. [31), we can see that 
while fitting our model to PDB data, the parameters take 
values around the helix/sheet transition point. We argue 
that this is not by chance, but is a prerequisite for pro- 
teins in living cells. Since diverse structure of proteins in 
living cells is required for their biological functions, par- 
ticular values of the parameters are needed to compro- 
mise the required properties such as stability, flexibility 
and diversity, by natural selection. 
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FIG. 4: Experimental data and theoretical simulation for the 
length dependence. The data are fitted by the model with 
parameters w = 1.03, p = 0.072, rj = 1,8= 1.15, a = 0.037. 
(a) The average number of alpha and beta residues, (b) The 
average number of Q-helices and /3-strands. 



D. Temperature Dependence 

From previous discussion, helical structure tends to 
have low enthalpy, and beta structure tends to have large 
entropy. Thus, under certain condition, we should have a 
transition from helical structure to beta structure when 
the temperature is increased. To study this temperature 
induced transition, we need to explore the dependence 
of parameters on temperature. For the parameter s, for 
instance, we have[l5| 

s = exp(AG/fc B T) 

where T is the temperature. Here AG is the free energy 
change by converting one residue from random coil state 
to helical state. Assume that AG is independent to the 
temperature. Then the parameters for corresponding 



temperatures Tj (i = 0, 1) are related by 

si = s£ 

where r = Tq /Tj is the ratio of original and final temper- 
ature. Let T — 300K be the room temperature, thus we 
can investigate the temperature induced structure tran- 
sition by changing the system parameters simultaneously 

f \ I r r r r r\ 

{a,s,r],w,p) -> {a ,s ,r] ,w ,p ) 
with r = T /Tq. The simulation results are shown in Fig. 
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FIG. 5: Temperature dependence of fractions of alpha and 
beta resides for both short (n = 20) and long (n = 200) 
chains. Here T = 300if, w = 1.03, p = 0.072,7? = 1, s = 
1.15,o- = 0.037. 



The transition from alpha to beta structure, when the 
temperature is increasing, is obvious in our simulation. 
For long chains, at low temperature, a-helix is dominated 
because of strong local interactions; while at high tem- 
perature, beta structure is more favored for larger en- 
tropy contribution. But for short chains (n <= 26 in 
this case), a-helix becomes unstable at low temperature, 
and 9 a drops to zero. A big drawback of the results in 
Fig. O is that the regular secondary structures will not 
break into random coils at high temperature, by which 
the proteins should unfold. This is mainly due to our 
strong assumption that AH does not change with the 
temperature, which is not true when the temperature is 
far from the room temperature. 



V. CONCLUSION 

In this paper, we extend the Z-B model to include beta 
structures, by introducing two sets of codes which rep- 
resent respectively the state of each amino acid residue 
and connecting pattern of the /3-strands. In additional 
to the bonding parameter and helical initiation factor in 
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Z-B model, three new parameters are introduced to de- 
scribe beta structures: the local constraint factor for a 
single residue to be contained in a /3-strand, the long- 
range bonding parameter that accounts for the interac- 
tion between a pair of bonded /3-strands, and a correction 
factor for the initiation of a /3-sheet. Then the partition 
function is obtained based on our model, which contains 
both short and long-range interactions. 

Through numerical study, the transition from random 
coil and a-helical conformation to /3-sheet structure are 
discussed respectively. In common, either the increase 
of the short-range or the long-range interactions of beta 
structure can cause a transition. And the sharpness of 
transition is mainly determined by the long-range bond- 
ing parameter and the initiation factor. However, the 
coil/sheet transition is enthalpy controlled, while he- 
lix/sheet transition is entropy driven. Other effective 
factors, such as the chain length and temperature, are 
also shown. The PDB data for protein chains with dif- 
ferent length are fitted by our model and show a fairly 
well agreement. In this way, we can identify the values of 
parameters in our model. Interestingly, these values are 
close to the transition point. This indicates that protein 
sequences in living cells are well selected by evolution, 
to compromise the required properties such as stability, 
flexibility and diversity. At last, by considering the re- 
lationship between statistical weights and free energy, a 
temperature induced helix/sheet transition is observed. 
Our results show that at low temperature, a-helix is dom- 
inated because of strong local interactions; while at high 
temperature, beta structure is more favored due to large 
entropy contribution. 

We hope our results may shed light on the mechanisms 
of structure transition in prion and other proteins, as well 
as the understanding of short and long-range interactions 
in the formation of secondary structures. 
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APPENDIX A 

The exact factor for different ways of arranging a- 
helices and /3-strands is undetermined in our present 
model, since our current knowledge about the secondary 
structure arrangement in natural proteins is too incom- 



plete to justify the models. In general, we have three ex- 
treme cases. If the connections are highly restricted, i.e., 
/3-strands are assumed to connect to only the neighboring 
ones, the factor C\ b +l would be a good estimate (Case 
Ai). On the other extreme, if the /3-strands can connect 
to each other freely, despite their long distance along the 
chain, the factor should be lb\C\ b +lb (Case A2). But this 
can only happen at very high temperature. Moreover, 
we can even assume that all a-helices and /3-strands are 
distinguishable, in case of very strong long-range inter- 
actions between secondary structure elements. Then the 
factor will be (l a + 4)! (Case A3). Different assumption 
may give different results, but the general properties of 
transition are almost unchanged, which is shown in Fig. 
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FIG. 6: Comparison of helix/sheet transition under three 
different assumptions. Case Ai with s — 1.15, a = 0.037, p = 
0.072 and 77 = 1. Case A 2 with s = 1.14, a = 0.04, p = 0.0062 
and 77 = 1. Case A 3 with s = 1.14, a = 0.0047, p = 0.007 and 
77 = 1. The values of parameters in case Ai(i = 1,2,3) are 
the same as in Fig. fTJ respectively. 



In this paper, we prefer the factor C^^. Since at 
room temperature, /3-strands can not joint freely; and 
the connecting patterns are quite limited. Moreover, if we 
compare case At{i = 1, 2, 3) with PDB data, we can see 
case A\ gives more reasonable result (see Fig. 0}. For case 
A2 and A3, the number of helical residues n a decreases 
with the increment of length n, when the chain is long 
enough (Fig. [7]). This is obviously contradictory to the 
experimental data. 
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